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Although coronaviruses are known to infect various animals by adapting to new hosts, interspecies transmission events are still 
poorly understood. During a surveillance study from 2005 to 2010, a novel alphacoronavirus, BatCoV HKU10, was detected in 
two very different bat species, Ro-BatCoV HKU10 in Leschenault’s rousettes (Rousettus leschenaulti) (fruit bats in the suborder 
Megachiroptera) in Guangdong and Hi-BatCoV HKU10 in Pomona leaf-nosed bats (Hipposideros pomona) (insectivorous bats 
in the suborder Microchiroptera) in Hong Kong. Although infected bats appeared to be healthy, Pomona leaf-nosed bats carry- 
ing Hi-BatCoV HKU10 had lower body weights than uninfected bats. To investigate possible interspecies transmission between 
the two bat species, the complete genomes of two Ro-BatCoV HKU10 and six Hi-BatCoV HKU10 strains were sequenced. Ge- 
nome and phylogenetic analyses showed that Ro-BatCoV HKU10 and Hi-BatCoV HKU10 represented a novel alphacoronavirus 
species, sharing highly similar genomes except in the genes encoding spike proteins, which had only 60.5% amino acid identities. 
Evolution of the spike protein was also rapid in Hi-BatCoV HKU10 strains from 2005 to 2006 but stabilized thereafter. Molecu- 
lar-clock analysis dated the most recent common ancestor of all BatCoV HKU10 strains to 1959 (highest posterior density re- 
gions at 95% [HPDs], 1886 to 2002) and that of Hi-BatCoV HKU10 to 1986 (HPDs, 1956 to 2004). The data suggested recent in- 
terspecies transmission from Leschenault’s rousettes to Pomona leaf-nosed bats in southern China. Notably, the rapid adaptive 
genetic change in BatCoV HKU10 spike protein by ~40% amino acid divergence after recent interspecies transmission was even 
greater than the ~20% amino acid divergence between spike proteins of severe acute respiratory syndrome-related Rhinolophus 
bat coronavirus (SARSr-CoV) in bats and civets. This study provided the first evidence for interspecies transmission of coronavi- 
rus between bats of different suborders. 


 Eeuabes (CoVs) infect a wide variety of animals, causing 
respiratory, enteric, hepatic, and neurological diseases of 
varying severity. Traditionally, CoVs have been classified into 
groups 1, 2, and 3, based on genotypic and serological character- 
istics (29, 79). Recently, the nomenclature and taxonomy of CoVs 
were revised by the Coronavirus Study Group of the International 
Committee for Taxonomy of Viruses (ICTV). CoVs are now clas- 
sified into three genera, Alphacoronavirus, Betacoronavirus, and 
Gammacoronavirus, which replace the three traditional groups 
(5). Novel CoVs, which represented a novel genus, Deltacoronavi- 
rus, have also been identified (72, 73). While CoVs from all four 
genera can be found in mammals, bat CoVs are likely the gene 
source of Alphacoronavirus and Betacoronavirus, and avian CoVs 
are the gene source of Gammacoronavirus and Deltacoronavirus (9, 
41, 73). CoVs are known to possess high frequency of recombina- 
tion and mutation rates, which may allow them to adapt to new 
hosts and ecological niches (21, 29, 35, 68, 71, 78). 

The severe acute respiratory syndrome (SARS) epidemic, 
caused by SARS CoV (SARS-CoV) (17, 27, 43), has boosted inter- 
est in the discovery of novel CoVs in both humans and animals 
(12, 20, 36, 41, 63, 65, 66, 72). In particular, a previously unknown 
variety of CoVs have been identified in bats from China and other 
countries, including SARS-related Rhinolophus bat CoVs (SARSr- 
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Rh-BatCoVs) in horseshoe bats, suggesting that bats are impor- 
tant reservoirs of CoVs (8, 13, 30, 31, 33, 40, 49, 59, 67, 70). How- 
ever, our understanding of the diversity, evolution, and 
interspecies transmission of CoVs in animals is still limited. For 
example, it remains unknown if bats are the direct origin of SARS- 
CoV in civets and humans, as the spike (S) protein of SARSr-Rh- 
BatCoV possesses only ~80% amino acid identity to that of civet 
SARSr-CoV, with significant differences in the receptor binding 
domain (30, 32, 40, 51). 

During a continuous surveillance study, in an attempt to better 
understand the role of bats in the evolution of CoVs and search for 
other bat species which may have served as intermediate hosts for 
interspecies transmission of SARSr-CoVs, a potentially novel al- 
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phacoronavirus, BatCoV HKU 10, was detected in two very differ- 
ent bat species. After its first detection in a Leschenault’s rousette 
in Guangdong (70), the virus was also found in Pomona leaf- 
nosed bats in Hong Kong. In the present study, the epidemiology 
of BatCoV HKU10 in different bat species was determined. To 
investigate possible interspecies transmission events, complete ge- 
nome sequencing and analysis of eight BatCoV HKU1O0 strains 
from the two bat species was performed. The results revealed that 
viruses from the two bat species were highly similar, except for 
their S proteins, which shared only ~60% amino acid identities. 
Positive selection and molecular-clock analysis showed that inter- 
species transmission of BatCoV HKU10 from Leschenault’s rou- 
settes in Guangdong to Pomona leaf-nosed bats in Hong Kong is 
likely to have occurred recently, with rapid evolution of the S 
protein in the latter bat species. 


MATERIALS AND METHODS 


Collection of bat samples. Bats of various species were captured from dif- 
ferent locations in Hong Kong and in the Guangdong province of southern 
China over a 5-year period (September 2005 to August 2010). Respiratory 
and alimentary specimens were collected using procedures described previ- 
ously (30, 77). To prevent cross contamination, specimens were collected 
using disposable swabs with protective gloves, which were changed between 
samples. All specimens were immediately placed in viral transport medium 
before transportation to the laboratory for RNA extraction. 

RNA extraction. Viral RNA was extracted from the respiratory and 
alimentary specimens using a QIAamp viral RNA minikit (Qiagen, 
Hilden, Germany). The RNA was eluted in 50 wl of AVE buffer and was 
used as the template for reverse transcription-PCR (RT-PCR). 

RT-PCR for CoVs and DNA sequencing. CoV detection was per- 
formed by amplifying a 440-bp fragment of the RNA-dependent RNA 
polymerase (RdRp) gene of CoVs using conserved primers (5'-GGTTGG 
GACTATCCTAAGTGTGA-3’ and 5’-CCATCATCAGATAGAATCATC 
ATA-3’) designed by multiple alignments of the nucleotide sequences of 
available RdRp genes of known CoVs as described previously (32, 66). 
Reverse transcription was performed using a SuperScript III kit (Invitro- 
gen, San Diego, CA). The PCR mixture (25 pl) contained cDNA, PCR 
buffer (10 mM Tris-HCl [pH 8.3], 50 mM KCl, 3 mM MgCL, and 0.01% 
gelatin), a 200 4M concentration of each deoxynucleoside triphosphate 
(dNTP), and 1.0 U Taq polymerase (Applied Biosystems, Foster City, 
CA). The mixtures were amplified with 60 cycles of 94°C for 1 min, 48°C 
for 1 min, and 72°C for 1 min and a final extension at 72°C for 10 min in 
an automated thermal cycler (Applied Biosystems, Foster City, CA). Stan- 
dard precautions were taken to avoid PCR contamination, and no false 
positives were observed in negative controls. 

The PCR products were gel purified using a QIAquick gel extraction 
kit (Qiagen, Hilden, Germany). Both strands of the PCR products were 
sequenced twice with an ABI Prism 3700 DNA analyzer (Applied Biosys- 
tems, Foster City, CA), using the two PCR primers. The sequences of the 
PCR products were compared with known sequences of the RdRp genes of 
CoVs in the GenBank database. Phylogenetic tree construction was per- 
formed using neighbor-joining method with ClustalX 1.83. 

Statistical analysis. Comparison of body weights of bats between dif- 
ferent groups was performed using Student’s t test and covariate analysis 
(SPSS version 11.5). A P value of <0.05 was regarded as statistically sig- 
nificant. 

Viral culture. Four samples positive for BatCoV HKU10 were cultured 
in FRhK-4 (rhesus monkey kidney; ATCC CRL-1688), Vero E6 (African 
green monkey kidney; ATCC CRL-1586), and HRT-18G (human colorectal 
adenocarcinoma; ATCC CRL-11663) cell lines and primary bat kidney and 
lung fibroblast cells derived from a Chinese horseshoe bat. 

Complete genome sequencing of Ro-BatCoV HKU10 and Hi- 
BatCoV HKU10. Six complete genomes of Hipposideros bat CoV HKU10 
(Hi-BatCoV HKU10) and two complete genomes of Rousettus bat CoV 
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HKU10 (Ro-BatCoV HKU10) detected in the present study were ampli- 
fied and sequenced using the RNA directly extracted from the alimentary 
specimens as templates according to previously described strategies (31, 
32). The RNA was converted to cDNA by a combined random-priming 
and oligo(dT) priming strategy. As the initial results revealed that they 
belong to Alphacoronavirus, the cDNA was amplified by degenerate prim- 
ers designed by a multiple alignment of the genomes of human CoV 229E 
(HCoV 229E) (GenBank accession no. NC_002645), porcine epidemic 
diarrhea virus (PEDV) (GenBank accession no. NC_003436), porcine 
transmissible gastroenteritis virus (TGEV) (GenBank accession no. 
NC_002306), feline infectious peritonitis virus (FIPV) (GenBank acces- 
sion no. AY994055), HCoV NL63 (GenBank accession no. NC_005831), 
and Rhinolophus bat CoV HKU2 (Rh-BatCoV HKU2) (GenBank acces- 
sion no. EF203067), and additional primers covering the original degen- 
erate primer sites were designed from the results of the first and subse- 
quent rounds of sequencing. These primer sequences are shown in Tables 
S1 and 82 in the supplemental material. The 5’ ends of the viral genomes 
were confirmed by rapid amplification of CDNA ends (RACE) using the 
5'-3' RACE kit (Roche, Germany). Sequences were assembled and man- 
ually edited to produce final sequences of the viral genomes. For the other 
positive samples not included in complete genome sequencing, additional 
PCR targeted to other genome sites, including partial fragments of the 
helicase (Hel) and S genes, was also performed using the genome sequenc- 
ing primers to exclude false positives due to PCR contamination. 

Genome analysis. The nucleotide sequences of the genomes and the 
deduced amino acid sequences of the open reading frames (ORFs) were 
compared to those of other CoVs. Phylogenetic trees were constructed 
using the maximum-likelihood method (18), with bootstrap values cal- 
culated from 100 trees. Protein family analysis was performed using 
PFAM and InterProScan (1, 2). Prediction of transmembrane domains 
was performed using TMHMM (55). 

Sequencing of the complete RdRp and S genes of Ro-BatCoV 
HKU10 and Hi-BatCoV HKU10 strains. To allow more accurate analysis 
of positive selection and divergence time, the complete RdRp genes of 25 
Hi-BatCoV HKU10 strains and the complete S genes of one Ro-BatCoV 
HKU10 and four Hi-BatCoV HKU1O0 strains, in addition to the eight 
strains with complete genome sequences, were amplified and sequenced 
using primers available from genome sequencing as described above. The 
sequences of the PCR products were assembled manually to produce the 
complete RdRp and S gene sequences. 

Estimation of synonymous- and nonsynonymous-substitution 
rates. The number of synonymous substitutions per synonymous site, K,, 
and the number of nonsynonymous substitutions per nonsynonymous 
site, K,, for each coding region were calculated using the Nei-Gojobori 
method (Jukes-Cantor) in MEGA version 5 (57). 

Recombination analysis and detection of positive selection. Recom- 
bination detection was performed among genomes of BatCoV HKU10 
strains using bootscan analysis and genetic algorithm recombination de- 
tection (GARD) with the Kishino-Hasegawa (KH) test as described pre- 
viously (25, 32, 36, 47). While bootscan analysis is a widely used tool for 
detection of recombination with the window size having strong influence 
on recombination inference, GARD is often used to accurately locate the 
recombination breakpoints and determine the level of statistical signifi- 
cance (25). For estimation of positive selection, BatCoV HKU10 strains 
were grouped based on the year of sampling and the host species after 
removal of duplicates: Ro-BatCoV HKU10, Hi-BatCoV HKU10 from 
2005-2006, Hi-BatCoV HKU10 from 2007-2008, Hi-BatCoV HKU10 
from 2005-2008, and Hi-BatCoV HKU10 from 2005-2010. Sites under 
positive selection in the S gene were inferred using single-likelihood an- 
cestor counting (SLAC), fixed-effects likelihood (FEL), and random- 
effects likelihood (REL) methods as implemented in the DataMonkey 
server (http://www.datamonkey.org) (48). Positive selection for a site was 
considered to be statistically significant if the P value was <0.1 for the 
SLAC and FEL methods or posterior probability was =90% for the REL 
method. An unrestricted random-effects branch site model, branch site 


jviiasm.org 11907 


AINN GNVTMVO Aq S102 ‘9 Id uo /Bio"wse"IAl//:dyyy Wo. pepeojumoq 


Lau et al. 


TABLE 1 Detection of Ro-BatCoV HKU 10 and Hi-BatCoV HKU10 in bats by RT-PCR 


Bat 


Suborder, family, and specific name 


Common name 


No. (%) of bats positive for BatCoV HKU10 in: 


No. of bats tested Respiratory samples Alimentary samples 


Megachiroptera 
Pteropodidae 
Cynopterus sphinx 
Rousettus leschenaulti 


Microchiroptera 
Hipposideridae 
Hipposideros armiger 
Hipposideros larvatus 
Hipposideros pomona 


Rhinolophidae 
Rhinolophus affinus 
Rhinolophus osgoodi 
Rhinolophus pusillus 
Rhinolophus sinicus 


Vespertilionidae 
Hypsugo pulveratus 
Miniopterus magnater 
Miniopterus pusillus 
Miniopterus schreibersii 
Myotis chinensis 
Myotis horsfieldii 
Myotis muricola 
Myotis ricketti 
Nyctalus noctula 
Pipistrellus abramus 
Pipistrellus tenuis 
Scotophilus kuhlii 


Short-nosed fruit bat 
Leschenault’s rousette 


Himalayan leaf-nosed bat 
Intermediate leaf-nosed bat 
Pomona leaf-nosed bat 


Intermediate horseshoe bat 
Osgood’s horseshoe bat 
Least horseshoe bat 
Chinese horseshoe bat 


Chinese pipistrelle 
Greater bent-winged bat 
Lesser bent-winged bat 
Common bent-winged bat 
Chinese myotis 
Horsfield’s bat 
Whiskered myotis 
Rickett’s big-footed bat 
Brown noctule 

Japanese pipistrelle 
Least pipistrelle 

Lesser yellow bat 


Tylonycteris pachypus Lesser bamboo bat 


24 0 (0) 0 (0) 
416 0 (0) 3 (0.7)" 
207 0 (0) 0 (0) 
2 0 (0) 0 (0) 
524 3 (0.6) 36 (7.2) 
339 0 (0) 0 (0) 
1 0 (0) 0 (0) 
83 0 (0) 0 (0) 
1,671 0 (0) 0 (0) 
1 0 (0) 0 (0) 
14 0 (0) 0 (0) 
380 0 (0) 0 (0) 
525 0 (0) 0 (0) 
86 0 (0) 0 (0) 
7 0 (0) 0 (0) 
3 0 (0) 0 (0) 
175 0 (0) 0 (0) 
38 0 (0) 0 (0) 
198 0 (0) 0 (0) 
11 0 (0) 0 (0) 
16 0 (0) 0 (0) 
75 0 (0) 0 (0) 


* Ro-BatCoV HKU10 was detected in three (0.8%) of 350 Leschenault’s rousette bats in Guangdong but none of 66 Leschenault’s rousette bats in Hong Kong. 
» Hi-BatCoV HKU1O0 was detected in 37 (7%) of 523 Pomona leaf-nosed bats in Hong Kong but not in one Pomona leaf-nosed bat in Guangdong. 


REL, was implemented for detecting lineage-specific selection (26). This 
method is usually used to identify branches in a tree with evidence of 
episodic diversifying selection and is known to be more robust to errors 
because it does not enforce uniform selective pressure on all background 
branches (26). 

Estimation of divergence time. Divergence time was calculated using 
RdRp gene sequence data of Hi-BatCoV HKU 10 and Ro-BatCoV HKU10 
strains and the Bayesian Markov chain Monte Carlo (MCMC) approach 
as implemented in BEAST (version 1.6.2) as described previously (10, 32, 
35, 36). One parametric model (constant size) and one nonparametric 
model (Bayesian Skyline with five groups) for tree priors were used for the 
inference. Analyses were performed with the SRD06 substitution model 
using both strict and relaxed [uncorrelated lognormal (Ucld) and uncor- 
related exponential (Uced)] molecular clocks. The MCMC run was 1 X 
108 steps long, with sampling every 1,000 steps. Convergence was assessed 
on the basis of the effective sampling size after a 10% burn-in using Tracer 
software version 1.5 (10). The mean time of the most recent common 
ancestor (tMRCA) and the highest posterior density regions at 95% 
(HPD) were calculated, and the best-fitting model was selected by a Bayes 
factor, using marginal likelihoods implemented in Tracer (56). Bayesian 
Skyline under a relaxed-clock model with Uced was adopted for making 
inferences, as Bayes factor analysis indicated that this model fitted the data 
better than other models tested (data not shown). The trees were summa- 
rized in a target tree by the Tree Annotator program included in the 
BEAST package by choosing the tree with the maximum sum of posterior 
probabilities (maximum clade credibility) after a 10% burn-in. 
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Nucleotide sequence accession numbers. The nucleotide sequences 
of the eight genomes of BatCoV HKU10 have been deposited in the 
GenBank sequence database under accession no. JQ989266 to JQ989273. 


RESULTS 

Detection of a novel alphacoronavirus in Leschenault’s rou- 
settes and Pomona leaf-nosed bats. A total of 9,443 respiratory 
and alimentary specimens from 4,796 bats of 22 species were ob- 
tained in Hong Kong and Guangdong Province in southern 
China. RT-PCRs for a 440-bp fragment in the RdRp genes of CoVs 
were positive for the potentially novel alphacoronavirus BatCoV 
HKU10 in the alimentary samples from three (0.7%) of 416 Le- 
schenault’s rousettes (Rousettus leschenaulti) and in the alimen- 
tary and respiratory samples from 36 (7.2%) and 3 (0.6%) of 524 
Pomona leaf-nosed bats (Hipposideros pomona), respectively (Ta- 
ble 1). Sequencing of the PCR products showed that these viral 
sequences formed a separate cluster distinct from known CoVs 
upon phylogenetic analysis, with <82% nucleotide identities to 
the corresponding sequences of Rh-BatCoV A977 (GenBank ac- 
cession no. DQ648855). All positive samples were confirmed by 
RT-PCR of multiple genome sites using primers targeted to Hel or 
S genes. All Leschenault’s rousettes positive for Ro-BatCoV 
HKU10 were from Guangdong Province, and all Pomona leaf- 
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TABLE 2 Coding potential and putative transcription regulatory sequences of Ro-BatCoV HKU10 and Hi-BatCoV HKU10 


Putative TRS 
No. of No. of Nucleotide position 
Coronavirus ORF Nucleotide position nucleotides aminoacids Frame in genome TRS sequence* 
Ro-BatCoV HKU10 183A lab 303-20644 20,342 6,780 es re 74 CUAAAC(220)AUG 
S 20641-24690 4,050 1,349 + 20628 CUAAAC(4)AUG 
NS3 24690-25346 657 218 +3 24655 
E 25375-25602 228 75 + 25353 CUAAAU(13)AUG 
M 25608-26297 690 229 +3) 25596 CUAAAC(3)AUG 
N 26308-27516 1,209 402 2, 26296 CUAAAC(4)AUG 
NS7a 27532-27777 246 81 + 27518 CUAAAC(5)AUG 
NS7b 27787-28248 462 153 + 
NS7c 27986-28216 231 76 +2 
Hi-BatCoV HKU10 TLC1310A lab 303—20647 20,345 6,781 +3, +2 74 CUAAAC(220)AUG 
S 20644-24699 4,056 1,351 + 20631 CUAAAC(4)AUG 
NS3 24699-25355 657 218 +3 24664 
E 25384-25611 228 75 + 25362 CUAAAU(13)AUG 
M 25617-26297 681 226 +3 25605 CUAAAC(3)AUG 
N 26309-27508 1,200 399 +2 26296 CUAAAC(4)AUG 
NS7a 27524-27766 243 80 2. 27510 CUAAAC(5)AUG 
NS7b 27776-28237 462 153 +2 
NS7c 27975-28205 231 76 +3 


* The number in parentheses is the number of nucleotides between the TRS and start codon. 


nosed bats positive for Hi-BatCoV HKU10 were from 6 of 15 
sampling locations in Hong Kong (Table 1). 

No obvious disease was observed in bats positive for Ro- 
BatCoV HKU10 or Hi-BatCoV HKU10. However, lower body 
weights were observed in Pomona leaf-nosed bats positive for Hi- 
BatCoV HKU10 (body weight [mean + standard deviation], 
6.67 + 0.4 g) than those negative for CoVs (6.95 + 0.8 g) (P = 
0.038 by Student’s t test). Since all 37 infected Pomona leaf-nosed 
bats were adults (juvenile and adult bats are differentiated by their 
fur color and finger joints), comparison was also performed using 
only adult Pomona leaf-nosed bats negative for CoVs (body 
weight, 7.00 + 0.8 g) (P = 0.016 by Student’s f test). To control for 
the confounding effect of variation in body weights in different 
seasons, e.g., after hibernation, covariate analysis was performed 
using only data from the months with positive detection (March, 
August, October, November, and December). Results showed that 
Hi-BatCoV HKU10 carriage was an independent factor in associ- 
ation with lower body weights (P = 0.016). Attempts to stably 
passage BatCoV HKU10 in cell cultures were unsuccessful, with 
no cytopathic effect or viral replication being detected. 

Complete genome characterization of Ro-BatCoV HKU10 
and Hi-BatCoV HKU1O0. Since the partial RdRp sequences sug- 
gested the presence of closely related viruses belonging to a poten- 
tially novel alphacoronavirus in two bat species, the complete ge- 
nome sequences of two strains of Ro-BatCoV HKU10, 175A and 
183A (from alimentary samples of two Leschenault’s rousettes), 
and six strains of Hi-BatCoV HKU10, TLC1310A, TLC1347A, 
TLC1343A, TT3A, SL12A, and LSH5A (from alimentary samples 
of six Pomona leaf-nosed bats), were determined to look for 
genomic differences between viruses from the two bat species and 
evidence of interspecies transmission. The eight genomes pos- 
sessed genome sizes of 28,483 to 28,494 nucleotides, with a G+C 
content of 38% to 39%. The two genomes of Ro-BatCoV HKU10 
from Leschenault’s rousettes had 99% overall nucleotide identity, 
while the six genomes of Hi-BatCoV HKU10 from Pomona leaf- 
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nosed bats had 99% overall nucleotide identity. On the other 
hand, comparison between Ro-BatCoV HKU10 and Hi-BatCoV 
HKU10 genomes showed only 93 to 97% nucleotide identity. 
Their genome organization was similar to that of other alphacoro- 
naviruses (Table 2; Fig. 1). In both Ro-BatCoV HKU10 and Hi- 
BatCoV HKU10 genomes, a putative transcription regulatory se- 
quence (TRS) motif, 5’-CUAAAC-3’, similar to that in other 
alphacoronaviruses was identified at the 3’ end of the leader se- 
quence and precedes each ORF except the NS3 and envelope (E) 
genes (Table 2) (11, 22). Preceding the E gene, an alternative TRS 
motif, 5’-CUAAAU-3’, was also identified in both the Ro-BatCoV 
HKU10 and Hi-BatCoV HKU10 genomes (Table 2). 

The characteristics of putative nonstructural proteins (NSPs) 
of ORF1 of Ro-BatCoV HKU10 and Hi-BatCoV HKU 10 and their 
predicted cleavage sites are summarized in Table 3. A unique pu- 
tative cleavage site at NSP10/11 or NSP10/12 was observed in both 
Ro-BatCoV HKU10 and Hi-BatCoV HKU10, where the P1’ posi- 
tion was occupied by alanine instead of serine or glycine as in 
other alphacoronaviruses. This amino acid substitution was due 
to mutations from TC(A/T), AG(T/C), or GGC to GCT in Ro- 
BatCoV HKU10 and Hi-BatCoV HKU10. 

One ORF, which encodes a putative 218-aa nonstructural pro- 
tein, NS3, was observed between the S and E genes of Ro-BatCoV 
HKU10 and Hi-BatCoV HKU10. This NS3, which is highly con- 
served among different strains of Ro-BatCoV HKU10 and Hi- 
BatCoV HKU10 with =98.2% amino acid identities, possessed 
only =47% amino acid identities to NS3 of Mi-BatCoV HKU8 
and other alphacoronaviruses. TMHMM analysis showed three 
putative transmembrane domains in NS3 of Ro-BatCoV HKU10 
strain 175A and all six Hi-BatCoV HKU10 strains (at residues 33 
to 53, 62 to 82, and 88 to 106, respectively), while only two puta- 
tive transmembrane domains were observed in NS3 of Ro- 
BatCoV HKU10 strain 183A (at residues 33 to 53 and 74 to 96, 
respectively). 

The most striking difference between Ro-BatCoV HKU10 and 
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FIG 1 Genome organizations of Ro-BatCoV HKU10, Hi-BatCoV HKU10, and representative CoVs from each group. Genes for papain-like proteases (PL1?"°, 
PL2?"°, and PL?*®), 3C-like protease (3CL?"°), and RNA-dependent RNA polymerase (RdRp) are represented by orange boxes. Genes for hemagglutinin esterase 
(HE), spike protein (S), envelope protein (E), membrane protein (M), and nucleocapsid protein (N) are represented by green boxes. Genes for putative accessory 
proteins are represented by blue boxes. BatCoV HKU10 strains detected in this study are shown in bold. 


Hi-BatCoV HKU10 genomes was observed in their S proteins, 
which consisted of 1,349 to 1,351 aa. In contrast to products of 
other regions of the genome, such as 3C-like protease (3CLP"°), 
RdRp, Hel, E, membrane (M), and nucleocapsid (N) proteins, 
where they possessed high sequence similarity (>96% amino acid 
identities), the S proteins of Ro-BatCoV HKU10 and Hi-BatCoV 
HKU10 shared only about 60.5% amino acid identities, as a result 
of frequent amino acid substitutions observed throughout their 
S-protein sequences (Table 4; also, see Fig. $1 in the supplemental 
material). The S protein of Ro-BatCoV HKU10 and Hi-BatCoV 
HKU10 shared <52% amino acid identity to the S proteins of 
other alphacoronaviruses (Table 4). As in other alphacoronavi- 
ruses (6), no cleavage site was identified between S1 and S2. Inter- 
ProScan analysis predicted them as type I membrane glycopro- 
teins, with most of the protein (residues 23/24/28 to 1292/1294) 
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exposed on the outside of the virus and with a transmembrane 
domain (residues 1293/1295 to 1327/1329) at the C terminus, 
followed by a cytoplasmic tail rich in cysteine residues. Two hep- 
tad repeats (HR), important for membrane fusion and viral entry 
(4), were located at residues 959 to 1085/1086 (HR1) and 1234 to 
1284 (HR2) for Ro-BatCoV HKU10 and at residues 952/955/959 
to 1048/1051/1053 (HR1) and 1235/1237 to 1284/1286 (HR2) for 
Hi-BatCoV HKU10. Aminopeptidase N (CD13) has been shown 
to be the receptor for various alphacoronaviruses, including 
HCoV 229E, canine CoV (CCoV), FIPV, PEDV, and TGEV (7, 
75). On the other hand, human angiotensin-converting enzyme 2 
(hACE2) has been found to be the receptor for both HCoV NL63, 
an alphacoronavirus, and SARS-CoV, a betacoronavirus, al- 
though they utilize different receptor-binding sites (23, 38). The S 
proteins of Ro-BatCoV HKU10 and Hi-BatCoV HKU10 did not 
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TABLE 3 Characteristics of putative nonstructural proteins of ORF lab 
in Ro-BatCoV HKU10 and Hi-BatCoV HKU10 


Amino acids? 


Putative function or Ro-BatCoV HKU10 Hi-BatCoV HKU10 


NSP — domain* (183A) (LSH5A) 
NSP1 Unknown M?!-A!> M!-A19S 
NSP2 Unknown KG aes K196_G888 
NSP3  ADRP, Putative PLP*° 5889_G?518 5889_G2519 
domain PL1?*°, PL2P"° 

NSP4 Hydrophobic domain S21220777° §2520_(2997 
NSP5 = 3CLPr° 52997 _. 93298 §2998_93299 
NSP6 Hydrophobic domain $2722 0227" $3300_(93575 
NSP7 Unknown §2573_(Q3657 53576_Q3658 
NSP8 Unknown §3658_(3852 53659_(Q3853 
NSP9 Unknown N2853_(g3960 N3854_Q3961 


NSP10 Unknown A3961_(94097 A3962_ 4098 
NSP11 Unknown (short peptide A*°°°-N*!1° A4099_yy4116 
at the end of ORFla) 
NSP12 RdRp A4998_ 9024 A4°99.Q50?5 
NSP13 Hel $5025_Q5621 $5026_(5622 
NSP14_ ExoN, N7-MTase y, Nabe © lancet A3623_Q6140 
NSP15 NendoU go140_Q6478 $6141_Q6479 
NSP16 2’-O-MT §6479_ 6780 56480_p6781 


* ADRP, ADP-ribose-1”-phosphatase; PL?’®, papain-like protease; 3CLP’®, 3C-like 
protease; RdRp, RNA-dependent RNA polymerase; Hel, helicase; ExoN, 3’-to-5’ 
exonuclease; N7-MTase, (guanine-N7)-methyltransferase; NendoU, nidoviral 
uridylate-specific endoribonuclease; and 2’-O-MT, 2'-O-ribose methyltransferase. 

» Given in the format first residueP?°°"-last residueP°!°", The alanine at the P1' 
position of the unique putative cleavage site at NSP10/11 or NSP10/12 is shown in bold. 


exhibit significant homology to the known receptor-binding do- 
mains of other CoVs, including HCoV 229E (3, 24, 28, 44, 74). 

Downstream of the N gene, both Ro-BatCoV HKU 10 and Hi- 
BatCoV HKU10 genomes (except those of strains TT3A and 
SL12A) possess three ORFs encoding nonstructural proteins 
NS7a, NS7b, and NS7c, of 80 or 81, 153, and 76 aa, respectively. 
Strains TT3A and SL12A possess NS7b and NS7c but not NS7a, as 
a result of a nucleotide substitution in the start codon of NS7a 
(ATG to ATT). And since Ro-BatCoV HKU10 and Hi-BatCoV 
HKU10 share only 60% amino acid identity in NS7a to other 
strains, this gene may be nonfunctional. In contrast, NS7b and 
NS7c were highly similar between the two viruses, sharing 92 to 
95% and 88 to 90% amino acid identities, respectively. However, a 
BLAST search revealed no significant amino acid similarities be- 
tween these putative nonstructural proteins and other known 
proteins. TMHMM analysis showed two putative transmembrane 
domains in NS7a (at residues 5 to 23 and 42 to 62/76) but none in 
NS7b. For NS7c, one putative transmembrane domain was ob- 
served (at residues 31 to 51) in all strains except Ro-BatCoV 
HKU10 strain 175A, which possessed no putative transmembrane 
domain in its NS7c. Some alphacoronaviruses, such as FIPV, 
TGEV, porcine respiratory CoV (PRCV), Rh-BatCoV HKU2, and 
Sc-BatCoV 512, are also known to possess genes downstream of 
that for N (Fig. 1). In FIPV, the two genes downstream of the N 
gene may be important for virulence, while in TGEV, the gene 
downstream of the N gene may play a role in membrane associa- 
tion of replication complexes or virus assembly (19, 42, 62). Fur- 
ther experiments will delineate the function of such ORFs in bat 
CoVs. 

Phylogenetic analyses. The phylogenetic trees constructed us- 
ing the amino acid sequences of the RdRp, Hel, S, and N proteins 
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of Ro-BatCoV HKU10, Hi-BatCoV HKU10, and other CoVs are 
shown in Fig. 2, and the corresponding pairwise amino acid iden- 
tities are shown in Table 4. For Hel, RdRp, and N genes, the two 
strains of Ro-BatCoV HKU10 and six strains of Hi-BatCoV 
HKU10 clustered together with very short branch lengths, reflect- 
ing their high sequence similarities (Fig. 2). Moreover, compari- 
son of the amino acid sequences of the seven conserved replicase 
domains or NSPs {ADP-ribose-1”-phosphatase, ADRP, NSP5 
(3CLP*°), NSP12 (RdRp), NSP13 (Hel), NSP14 [3’-to-5’ exonu- 
clease, ExoN; (guanine-N7)-methyltransferase, N7-MTase], 
NSP15 (nidoviral uridylate-specific endoribonuclease, NendoU) 
and NSP16 (2'-O-ribose methyltransferase, 2’-O-MT)} for CoV 
species demarcation (5) showed that Ro-BatCoV HKU10 and Hi- 
BatCoV HKU10 possess <90% amino acid identities to those of 
other alphacoronaviruses but >90% amino acid identities to each 
other, indicating that they represented the same novel species of 
Alphacoronavirus. 

In contrast, marked sequence divergence was observed between 
the S proteins of Ro-BatCoV HKU10 and Hi-BatCoV HKU10 
strains, forming two distinct clusters upon phylogenetic analysis, 
which was confirmed by further sequencing of the S genes of one 
additional Ro-BatCoV HKU10 and four additional Hi-BatCoV 
HKU 10 strains (Table 4; Fig. 2). Moreover, among the 10 Hi-BatCoV 
HKU10 strains with available S-gene sequences, two strains, TLC43A 
and TLC47A, both detected in 2010, appeared to form a distinct clus- 
ter, sharing ~95% amino acid identities to the other eight strains, 
with most of the substitutions being nonsynonymous substitutions 
localized within the S1 region. 

Estimation of synonymous and nonsynonymous substitu- 
tion rates. As demonstrated in studies on the evolution and cross- 
species transmission of SARS-CoV-like viruses, high K,/K, ratios 
and substantial changes in the spike proteins of coronaviruses may 
reflect rapid viral evolution soon after introduction into a new 
animal host (54). Since results from genome analysis suggested 
that Ro-BatCoV HKU10 and Hi-BatCoV HKU10 possess highly 
similar genome sequences except in the S genes, we hypothesize 
that interspecies transmission between the two bat species oc- 
curred recently, with subsequent viral adaptation in the new host 
species. To test this hypothesis, the K,/K, ratios for the various 
coding regions in different strains of Ro-BatCoV HKU10 and 
Hi-BatCoV HKU10 were determined (Table 5). Compared to 
Ro-BatCoV HKU10 strains, higher K,/K, ratios were observed 
in S (0.277 versus 0.078) and N (0.235 versus 0.077) genes of 
Hi-BatCoV HKU10 strains. Interestingly, when the K,/K, ra- 
tios of S genes of additional strains from different sampling 
times were compared, a dramatic reduction of K,/K, ratios, 
from 2.000 among four Hi-BatCoV HKU10 strains from 2005- 
2006 to 0.333 among four Hi-BatCoV HKU10 strains from 
2007-2008 and to 0 among two Hi-BatCoV HKU10 strains 
from 2010, was observed, compared to 0.070 among three Ro- 
BatCoV HKU10 strains from 2005. This suggests that the S 
gene of Hi-BatCoV HKU10 from 2005-2006 likely underwent 
rapid evolution under positive selection in Pomona leaf-nosed 
bats and that this evolution stabilized thereafter. 

Recombination analysis and detection of positive selection. 
No significant recombination breakpoint among BatCoV HKU10 
genomes was detected by bootscan or GARD analysis. Significant 
positive selection was predicted by the REL method but not the 
SLAC and FEL methods, as the REL method is more powerful, 
since it pools signals from multiple sites to detect selection. Branch 


jviiasm.org 11911 


AINN GNVTVO Aq S102 ‘9 IWdy uo /Bio"wse"lAl//:dyyy Wo. pepeojumoq 


Lau et al. 


TABLE 4 Comparison of genome sizes and amino acid identities between predicted proteins of Ro-BatCoV HKU10, Hi-BatCoV HKU10, and other 
CoVs 


Pairwise amino acid identity (%) 


: Ro-BatCoV HKU10 183A Hi-BatCoV HKU10 LSH5A 
Genoese eee seen see 
Coronavirus* (bases) 3CLP*° RdRp Hel S E M N 3CLP*° RdRp Hel S E M N 
Alphacoronavirus 
TGEV 28,586 63.6 75 77 42.8 32.9 47 36.7 63.2 74.7 76.8 42.55 32.9 44.9 36.7 
FIPV 29,355 62.6 75.2 77 42.8 32.9 44.9 33.3 62.3 74.9 76.8 42.8 32.9 42.6 35.3 
PRCV 27,550 63.6 74.8 76.6 41.1 36.6 47.1 36.1 63.2 74.5 76.5 41.1 366 45.4 36.3 
CCoV 29,363 61.3 74.9 77.5 42.66 35.4 44.8 36.5 60.9 74.6 77.3 42.1 35.4 42.9 36.9 
MCoV 28,894 58.9 74.0 76.5 43.3 31.7 45.4 34.7 59.3 73.8 76.4 42.1 31.7 446 34.4 
HCoV 229E 27,317 67.9 79.3 82.9 42.8 468 60.7 42.4 67.5 79.2 82.7 43.2 468 59.7 42.3 
HCoV NL63 27,553 65.7 82.1 84.8 463 49.4 65.5 43.1 65.3 81.9 84.6 45.3 49.4 66.8 43.5 
PEDV 28,033 74.5 82.5 86.3 449 51.9 72.9 43.8 74.2 82.2 86.1 46.8 51.9 73 43.2 
Rh-BatCoV HKU2 27,165 62.3 79.9 80.1 25.6 60 68.1 47 61.9 79.8 79.9 25.2 60 69 46.8 
Mi-BatCoV 1A 28,326 70.5 83.4 84.3 47.4 52 65.9 47.8 70.2 83.3 84.1 47.55 52 64.7 47.8 
Mi-BatCoV 1B 28,476 70.2 82.6 84.3 45.7 52 64.3 48.1 69.9 82.5 84.1 45.1 52 64.7 48.3 
Mi-BatCoV HKU8 28,773 74.5 84.1 86.1 51.4 50.7 67.7 46.3 74.5 83.8 85.9 51.2 50.7 66.9 45.8 
Sc-BatCoV 512 28,203 70.9 80.7 82.6 46.3 53.2 69.9 46.7 71.2 80.4 82.4 45.55 53.2 69.6 46.9 
Ro-BatCoV HKU10 28,494 99.7 99.5 99.8 60.5 100 96.1 97 
183A 
Hi-BatCoV HKU10 28,492 99.7 99.5 99.8 60.5 100 96.1 97 
LSH5A 
Betacoronavirus subgroup 
A 
HCoV OC43 30,738 44,2 57 56.9 27 30.2 35.3 25.9 44.2 56.9 56.8 27 30.2. 35.5 25.7 
BCoV 31,028 43.9 56.9 57.1 27.4 30.2 36.1 24.9 43.9 56.7 56.9 25.4 30.2 36.1 25.1 
PHEV 30,480 43.6 56.9 SAL 27 30.2 34.2 26.5 43.6 56.7 56.9 25.9 30.2 34.3 25.7 
GiCoV 30,979 43.9 56.9 57.1 27.4 30.2 36.1 25.5 43.9 56.7 56.9 25.5 30.2 36.1 25.3 
MHV 31,357 45.2 56.4 56.9 266 32.5 36.8 25 45.2 56.5 56.7. 25.7. 32.5 37.2 24.7 
HCoV HKUI1 29,926 44.6 56.5 55.6 269 31 36.7. 26.7 44.2 56.1 55.4 263 31 38 27.1 
Betacoronavirus subgroup 
B 
SARS-CoV 29,751 44.8 59.1 62 26.55 21.1 32.8 26.9 45.1 59 61.8 25.6 21.1 31.7 26.4 
SARSr-Rh-BatCoV 29,728 44.1 59.1 61.6 26 21.1 32.3 27.1 44.4 59 61 25.1 211. 313 273 
HKU3 
Betacoronavirus subgroup 
Cc 
Ty-BatCoV HKU4 30,286 45.3 59.9 62.5 27.8 244 35.7 26.5 45 59.8 62.3 26.7 244 36 26.9 
Pi-BatCoV HKU5 30,488 45.9 59.4 63:5. 25.8 23.2 33 26.7 45.6 29:3 63.3 26.4 23.2 33.2 26.9 
Betacoronavirus subgroup 
D 
Ro-BatCoV HKU9 29,114 43.8 59.3 61.8 26.5 15.2 33 23.4 44.2 59.1 61.6 27 15.2 32.2 23.7 
Gammacoronavirus 
IBV 27,608 41.4 59.5 57.8 265 15.7 25 24 41.1 59.4 57.6 26.7 15.7 25.4 22.7 
BWCoV SW1 31,686 40.8 57.2 58.5 27.5 22.7 25.1 27.3 40.5 ov 58.4 268 22.7 24.6 26.9 
Deltacoroanvirus 
BuCoV HKU11 26,476 34.8 49.1 50.8 38 21.7, 26.9 21.2 34.8 49 50.7. 37.6 21.7 25.7 21.7 
ThCoV HKU12 26,396 34.8 48.6 50.7. 37.6 22.9 28 21.3 34.8 48.5 50.55 37.9 22.9 27.8 21.8 
MunCoV HKU13 26,552 34,2 49.4 50.9 38.3 24.1 25.1 19.8 34.2 49.3 50.7. 36.3 24.1 25.7 20.2 
PorCoV HKUI5 25,421 34.8 48.8 51.6 37.4 214 25.1 19.7 34.8 48.6 51.4 37.4 214 24.55 20.7 


* TGEV, porcine transmissible gastroenteritis virus; FIPV, feline infectious peritonitis virus; PRCV, porcine respiratory coronavirus; HCoV 229E, human coronavirus 229E; HCoV NL63, 
human coronavirus NL63; PEDV, porcine epidemic diarrhea virus; CCoV, canine coronavirus; MCoV, mink coronavirus; Rh-BatCoV HKU2, Rhinolophus bat coronavirus HKU2; Mi- 
BatCoV 1A, Miniopterus bat coronavirus 1A; Mi-BatCoV 1B, Miniopterus bat coronavirus 1B; Mi-BatCoV HKU8, Miniopterus bat coronavirus HKU8; Sc-BatCoV 512, Scotophilus bat 
coronavirus 512; Ro-BatCoV HKU10, Rousettus bat coronavirus HKU10; Hi-BatCoV HKU10, Hipposideros bat coronavirus HKU10; HCoV HKU1, human coronavirus HKU1; HCoV 
OC43, human coronavirus OC43; MHV, murine hepatitis virus; BCoV, bovine coronavirus; PHEV, porcine hemagglutinating encephalomyelitis virus; GiCoV, giraffe coronavirus; SARS- 
CoV, SARS coronavirus; SARSr-Rh-BatCoV HKU3, SARS-related Rhinolophus bat coronavirus HKU3; Ty-BatCoV HKU4, Tylonycteris bat coronavirus HKU4; Pi-BatCoV HKU5, 
Pipistrellus bat coronavirus HKU5; Ro-BatCoV HKU, Rousettus bat coronavirus HKU9; IBV, infectious bronchitis virus; BWCoV SW1, beluga whale coronavirus SW1; BuCoV HKU11, 
bulbul coronavirus HKU11; ThCoV HKU 12, thrush coronavirus HKU12; MunCoV HKU13, munia coronavirus HKU13; PorCoV HKU15, porcine coronavirus HKU15. 
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FIG 2 Phylogenetic analysis of RdRp, Hel, S, and N of Hi-BatCoV HKU10 and Ro-BatCoV HKU10. The trees were constructed by the maximum-likelihood 
method with bootstrap values calculated from 100 trees. A total of 951, 609, 1,899 and 572 amino acid positions in RdRp, Hel, S, and N, respectively, were 
included in the analysis. The scale bars indicate the estimated number of substitutions per 5 or 10 aa. HCoV 229E, human coronavirus 229E; PEDV, porcine 
epidemic diarrhea virus; TGEV, porcine transmissible gastroenteritis virus; FIPV, feline infectious peritonitis virus; PRCV, porcine respiratory coronavirus; 
HCoV NL63, human coronavirus NL63; Rh-BatCoV HKU2, Rhinolophus bat coronavirus HKU2; Mi-BatCoV 1A, Miniopterus bat coronavirus 1A; Mi-BatCoV 
1B, Miniopterus bat coronavirus 1B; Mi-BatCoV HKU8, Miniopterus bat coronavirus HKU8; Sc-BatCoV 512, Scotophilus bat coronavirus 512; HCoV HKU1, 
human coronavirus HKU1, HCoV OC43, human coronavirus OC43; MHV, murine hepatitis virus; BCoV, bovine coronavirus; PHEV, porcine hemagglutinat- 
ing encephalomyelitis virus; GiCoV, giraffe coronavirus; SARS-CoV, SARS coronavirus; SARSr-Rh-BatCoV HKU3, SARS-related Rhinolophus bat coronavirus 
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TABLE 5 Estimation of nonsynonymous substitution and synonymous 
rates in the genomes of Ro-BatCoV HKU10 and Hi-BatCoV HKU10 


K,/K, ratio 


Hi-BatCoV HKU10 Ro-BatCoV HKU10 


Gene (6 strains) (2 strains) 

NSP1 0.121 0.285 

NSP2 0.173 0.071 

NSP3 0.100 0.313 

NSP4 0.050 0.063 

NSP5 0.022 0.029 

NSP6 0.051 K, = 0, K, = 0.037 
NSP7 K, = 0, K, = 0.021 0.128 

NSP8 0.047 K, = 0, K, = 0.023 
NSP9 K, = 0, K, = 0.022 K, = 0, K, = 0.027 
NSP10 0.022 K, = 0, K, = 0.031 
NSP11 K, = 0, K,=0 K,=0,K,=0 
NSP12 0.063 0.061 

NSP13 0.012 K, = 0, K, = 0.043 
NSP 14 0.028 0.021 

NSP15 0.053 0.012 

NSP16 0.087 0.043 

S 0.2774 0.078" 

NS3 0.077 0.108 

E K, = 0, K,=0 K,=0,K,=0 

M 0.333 0.290 

N 0.235 0.077 

NS7a K, = 0, K,=0 0.151 

NS7b 1.000 0.740 

NS7c 0.435 0.575 


* The K,/K, of S sequences of four Hi-BatCoV HKU10 strains from 2005-2006 was 
2.000, that of four Hi-BatCoV HKU10 strains from 2007-2008 was 0.333, and that of 
two Hi-BatCoV HKU10 strains from 2010 was 0 (K, = 0, K, = 0.001). 

» The K,/K, of S sequences of three Ro-BatCoV HKU10 strains from 2005 was 0.070. 


site REL analysis of S-gene sequences showed that only the branch 
of two Hi-BatCoV HKU10 strains from 2010 was under signifi- 
cant positive selection (P = 0.003), with the strength of positive 
selection (w*) and the proportion of total branch length affected 
by positive selection (q*) being 4,378.93 and 0.03, respectively 
(Fig. 3A). This suggested that the S gene of Hi-BatCoV HKU10 
evolved under positive selection along the year 2010 lineage on 
short segments of the branch. REL analysis found that 66 of the 
1,351 codons in the S proteins of Hi-BatCoV HKU10 strains from 
2005 to 2010 were under positive selection. Most of these sites 
were distributed within the $1 domain, indicating that this do- 
main may have been under functional constraints (Fig. 3B). How- 
ever, since detection of specific amino acid sites under positive 
selection using REL is unstable in the presence of heterotachy, 
only the trends of spatial localization were indicated. 

Estimation of divergence dates. Using the relaxed clock model 
with Uced on RdRp gene sequences, tMRCA of all BatCoV 
HKU10 strains was estimated at 1959.34 (HPDs, 1886.23 to 
2002.77), approximately 53 years ago. The tMRCA of Hi-BatCoV 
HKU10 was estimated at 1986.88 (HPDs, 1956.17 to 2004.76) and 
that of Ro-BatCoV HKU 10 at 1991.58 (HPDs, 1968.62 to 2004.41) 
(Fig. 4). The estimated mean substitution rate of the RdRp data set 


was 3.705 X 10 * substitution per site per year, which is compa- 
rable to previous estimations for other CoVs (32, 35, 50, 64). 


DISCUSSION 


In this study, we detected and characterized a novel alphacoronavi- 
rus, BatCoV HKU10, from two very different bat species in China. 
Ro-BatCoV HKU10 was detected in three Leschenault’s rousettes in 
Guangdong Province, whereas Hi-BatCoV HKU10 was detected in 
37 Pomona leaf-nosed bats in Hong Kong. The genomes of Ro- 
BatCoV HKU10 and Hi-BatCoV HKU10 were highly similar except 
in the S region, where the two viruses shared only 60.5% amino acid 
identities. Nevertheless, they formed a distinct cluster within Alpha- 
coronavirus upon phylogenetic analysis, supporting the idea that 
BatCoV HKU10 represents a novel species. Since Ro-BatCoV 
HKU10 and Hi-BatCoV HKU10 have >90% amino acid identity in 
the seven conserved replicase domains for CoV species demarcation 
by ICTV (5), these two CoVs should be recognized as the same species 
infecting two different bat species. 

The marked difference between the S proteins of Ro-BatCoV 
HKU10 and Hi-BatCoV HKU10 despite the high similarity be- 
tween their genomes in other regions strongly suggested that they 
shared a recent common ancestor. Moreover, positive selection 
and molecular-clock analysis suggested that BatCoV HKU10 may 
have been transmitted to the new host, Pomona leaf-nosed bats, 
relatively recently. First the K,/K, ratio of the S gene of Hi-BatCoV 
HKU10 was higher than that of Ro-BatCoV HKU10, although the 
latter was detected only in Leschenault’s rousettes sampled in 
2005. Moreover, the drop in K,/K, ratio for S genes of Hi-BatCoV 
HKU10 from 2.000 among strains from 2005-2006 to 0 among 
strains from 2010 suggested that the S gene of Hi-BatCoV HKU10 
was under strong positive selection during 2005-2006, which was 
probably due to recent interspecies transmission and adaptation 
in the new host species, Pomona leaf-nosed bats. Second, signifi- 
cant positive selection was observed at the branch of two Hi-Bat- 
CoV HKU10 strains from 2010, with most of the codons under 
selection being distributed within the $1 domain. This suggested 
that these most recent strains have undergone further rapid evo- 
lution in their $1 domains, which may have favored the emergence 
of a novel subtype to adapt to new host and/or environmental 
factors. Third, molecular-clock analysis of the RdRp genes dated 
the tMRCA of all BatCoV HKU10 strains at around 1959 (HPDs, 
1886 to 2002) and that of Hi-BatCoV HKU10 at around 1986 
(HPDs, 1956 to 2004), which supported the recent emergence of 
BatCoV HKU10 and recent interspecies transmission to Pomona 
leaf-nosed bats. Based on the above evidence, it is likely that Bat- 
CoV HKU10 was transmitted to Pomona leaf-nosed bats not long 
before 2005, most probably from Leschenault’s rousettes, and the 
virus has been rapidly adapting in the new host by changing its S 
protein. However, as the number of bat samples, especially from 
Pomona leaf-nosed bats in Guangdong, was limited in this study, 
further studies on more samples and virus isolation in cell cultures 
derived from the two bat species may allow a more accurate de- 
termination of the directionality of interspecies transmission and 
exclude other possible explanations of the observed difference in S 
proteins, such as host selection driving rapid evolution. Moreover, 


HKU3; Ty-BatCoV HKU4, Tylonycteris bat coronavirus HKU4; Pi-BatCoV HKU5, Pipistrellus bat coronavirus HKU5; Ro-BatCoV HKU9, Rousettus bat 
coronavirus HKU9; IBV, infectious bronchitis virus; BWCoV SW1, beluga whale coronavirus SW1; BuCoV HKU11, bulbul coronavirus HKU11; ThCoV 
HKU12, thrush coronavirus HKU12; MunCoV HKU13, munia coronavirus HKU13; PorCoV HKU15, porcine coronavirus HKU15. 
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FIG 3 Selection pressure analysis of the S genes of BatCoV HKU10. (A) Detection of lineage-specific selection pressure. The branch with a P value of <0.01 is 
highlighted. w*, strength of positive selection; q*, proportion of the total branch length influenced by the selective pressure. The scale bar indicates the estimated 
number of substitutions per 20 nucleotides. (B) Distribution of positively selected sites in S protein genes identified using REL among Hi-BatCoV HKU 10 strains 
from 2005-2010. The receptor-binding domains (RBD) of the S proteins of TGEV, HCoV NL63, and HCoV 229E were mapped previously (3, 14, 74). Homology 
modeling of the RBD in the S proteins of Ro-BatCoV HKU10 and Hi-BatCoV HKU10 was performed using SwissModel in automated mode (52). The heptad 
repeat (HR) regions were predicted by using the coiled-coil prediction program MultiCoil2 (61). 


detection of more strains of Ro-BatCoV HKU10 from Le- 
schenault’s rousettes in the near future for evolutionary studies 
may help further confirm that these bats are the primary reservoir 
of BatCoV HKU10. The S proteins of CoVs are responsible for 
receptor binding and host species adaptation, and their genes 


November 2012 Volume 86 Number 21 


therefore constitute one of the most variable regions within CoV 
genomes (30, 31, 40). Previous studies on SARS-CoV have also 
provided clues on how changes in the CoV S protein, both within 
and outside the receptor-binding domain, may govern CoV cross- 
species transmission and emergence in new host populations (16, 
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FIG 4 Estimation of the tMRCA of BatCoV HKU10. The time-scaled phylogeny was summarized from all MCMC phylogenies of the RdRp gene data set 
analyzed under the relaxed-clock model with an exponential distribution (Uced) in BEAST version 1.6.2. Viruses characterized in this study are in bold. 


45). The present results also suggested that the CoV S protein is 
able to evolve rapidly within a short time after viral transmission 
to anew host, analogous to the situation of SARSr-CoV evolution 
(39, 54, 76). In fact, the sequence divergence between the S pro- 
teins of Ro-BatCoV HKU10 and Hi-BatCoV HKUI1O0 is even 
higher than that between SARSr-Rh-BatCoV and civet SARSr- 
CoV (30, 40), which in turn supported the idea that horseshoe bats 
could well be the reservoir for the direct ancestor of SARSr-CoVs 
in civets, with recent bat-to-civet transmission. Unfortunately, bat 
CoVs discovered so far cannot be cultivated in traditional in vitro 
cell lines, which has hampered studies on their receptor binding 
and host adaptation. 

The interspecies transmission of BatCoV HKU10 could well be 
explained by the biological characteristics of its host species. Bats 
(order Chiroptera), which account for about 20% of all mamma- 
lian species, are classified into two suborders, Microchiroptera 
and Megachiroptera. Pomona leaf-nosed bats are small, insectiv- 
orous bats belonging to the suborder Microchiroptera, family 
Hipposideridae, with a body weight ranging from 6 to 8 g. In Hong 
Kong, they are very common and widespread throughout coun- 
tryside areas and roost in colonies with up to several hundred 
individuals, mainly in water tunnels and abandoned mines or 
other enclosures with limited airflow. Interestingly, leaf-nosed 
bats belonging to Hipposideridae have also been found to harbor 
coronaviruses, including alphacoronaviruses closely related to 
HCoV 229E, with the most recent common ancestor of these al- 


11916 jvi.asm.org 


phacoronaviruses and HCoV 229E being dated to approximately 
1686-1800 (15, 46). In contrast, Leschenault’s rousettes are fruit 
bats belonging to the suborder Megachiroptera, family Pteropo- 
didae, with large body size, weighing 54 to 155 g and with a fore- 
arm length up to 88 mm (53). This bat species is widely distributed 
in Asia and roosts in extremely densely packed colonies of up to 
several thousand individuals. It is also well known for a very long 
flying distance, >11 km, and the ability to tolerate diverse and 
harsh habitats. These special biological features probably explain 
the ability of Leschenault’s rousettes to acquire various viruses as 
well as to transmit them to other bat species. Transmission of 
BatCoV HKU10 from Leschenault’s rousettes residing in Guang- 
dong to Pomona leaf-nosed bats in Hong Kong is possible, given 
that the two places are only about 140 km apart. According to 
survey records in Hong Kong, these two species have also been 
found to share roosting sites, which would allow indirect contact. 
Besides Ro-BatCoV HKU10, Leschenault’s rousettes from China 
have also been found to carry other viruses, including diverse ge- 
notypes of Ro-BatCoV HKU9, a subgroup D betacoronavirus, 
arising from recombination, as well as Tuhokovirus 1, 2, and 3, 
which are rubulaviruses belonging to the family Paramyxoviridae 
(33, 34). Although no evidence for recombination was observed 
among the present BatCoV HKU10 strains, coinfection of differ- 
ent CoVs in the same bat species may potentially create opportu- 
nities for recombination and emergence of new viruses. 
Although bats infected with BatCoV HKU10 appeared to be 


Journal of Virology 


AINN GNVTMVO Aq S102 ‘9 Iidy uo /Bio"wse"lAl//:djjy Wo. pepeojumoq 


healthy, lower body weights were observed in Pomona leaf-nosed 
bats positive for Hi-BatCoV HKU 10 than those negative for CoVs. 
This is similar to our previous findings that Chinese horseshoe 
bats infected with SARSr-Rh-BatCoV had lower body weights 
than those that were uninfected or infected with another CoV, 
Rh-BatCoV HKU2 (32). This supports the idea that certain bat 
CoVs may cause acute infection associated with weight loss in 
their host species. The fact that BatCoV HKU10 was detected 
mainly in alimentary samples also suggests an enteric tropism. 
However, further studies are required to understand the pathoge- 
nicity of BatCoV HKU10 in its host species. 

The present study not only provides the first evidence for in- 
terspecies transmission of a CoV between two very different bats 
belonging to different suborders but also illustrates the power of 
genome sequencing and analysis in understanding the evolution 
and ecology of CoVs. The present Hi-BatCoV HKU10 genomes 
also represented the first genome data available for CoVs in bats 
belonging to the genus Hipposideros. While the existence of CoVs 
in bats was unknown until after the SARS epidemic, different bat 
populations from various countries are now known to harbor 
diverse CoVs, likely as a result of their species diversity, roosting 
behaviors, and migrating abilities (30, 40, 49, 58, 67, 70). The 
present data also support the idea that these warm-blooded flying 
vertebrates are ideal hosts for the gene source for Alphacoronavirus 
and Betacoronavirus to fuel coronavirus evolution and dissemina- 
tion (73). Should recombination occur among these bat CoVs 
when bats are in proximity to other animals, such as in markets 
and restaurants in Guangdong (69), these animals could well be 
the source of new epidemics, like SARS. Bats are increasingly rec- 
ognized as reservoirs for various zoonotic viruses, including lys- 
savirus, rabies virus, Hendra virus, Nipah Ebola virus, and influ- 
enza virus (37, 60). Continuous studies of viruses from different 
bat species and their genome analysis would help us better under- 
stand the role of bats in the ecology and evolution of CoVs and 
other zoonotic viruses. 
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